High Performance Natural Language Processing on Semantic Network Array Processor
نویسندگان
چکیده
This paper describes a natural language processing system developed for the Semantic Network Array Processor (SNAP). The goal of our work is to develop a scalable and high-performance natural language processing system which utilizes the high degree of parallelism provided by the SNAP machine. We have implemented an experimental machine translation system as a central part of a real-time speech-to-speech dialogue translation system. It is a SNAP version of the DMDIAIOG speech-to-speech translation system. Memory-based natural language processing and syntactic constraint network model has been incorporated using parallel marker-passing which is directly supported from hardware level. Experimental results demonstrate that the parsing of a sentence is done in the order of milliseconds. 1 Introduction In this paper, we will demonstrate that natural language processing speeds in the order of milliseconds is attainable by using a marker-propagation algorithm and a specialized parallel hardware. The significance of the high-performance (or real-time) natural language processing is well known. Parsing sentences at the milliseconds speeds enables the realization of a speech recognition module capable of real-time speech understanding which eventually leads to the real-time simultaneous interpretation system. Also, the millisecond order performance enables the system to parse hundreds of sentences in a second , or over 3 million sentences per hour. This in turn makes possible bulk processing of text such as full-text retrieval, summarization, classification, translation, indexing and tagging. In order to accomplish the high-performance natural language processing, we have designed a highly parallel machine called Semantic Network Array Processor (SNAP) [Moldovan and Lee, 1990] [Lee and Moldovan, 19901, and implemented an experimental machine translation system called DMSN AP using a parallel marker-passing scheme. DM-SNAP is a SNAP implementation of the DMDIALOG speech-to-speech dialogue translation system [Kitano, 1990a) [Ki-tano, 1991a], but with some modifications to meet hardware constraints. Despite its high performance, our system carries out sound syntactic and se) antic analysis including lexical ambiguity, structural ambiguity, pronoun reference, control, unbounded dependency, and others. In the next section, we describe briefly the SNAP architecture , then, describe design philosophy behind the DMSNAP followed by descriptions on implementation and linguistic processing. Finally, performance are presented. 2 SNAP Architecture The Semantic Network Array Processor (SNAP) is a highly parallel array processor fully optimized for semantic network processing with marker-passing mechanism. In order to facilitate efficient propagation of markers and to case development of applications, a set of marker propagation instructions has been microcoded. SNAP supports …
منابع مشابه
Utility of Massively Parallel Computing Platform in Natural Language Processing*
This paper demonstrates the utility of the Semantic Network Array Processor (SNAP) as a massively parallel platform for high performance and large-scale natural language processing systems. SNAP is an experimental massively parallel machine which is dedicated to, but not limited to, the natural language processing using semantic networks. In designing the SNAP, we have investigated various natu...
متن کاملToward High Performance Machine Translation: Preliminary Results from Massively Parallel Memory-Based Translation on SNAP*
This paper describes a memory-based machine translation system developed for the Semantic Network Array Processor (SNAP). The goal of our work is to develop a scalable and high-performance memory-based machine translation system which utilizes the high degree of parallelism provided by the SNAP machine. We have implemented an experimental machine translation system DMSNAP as a central part of a...
متن کاملSemantic Network Array Processor as a Massively Parallel Computing Platform for High Performance and Large-Scale Natural Language Processing
This paper demonstrates the utility of the Semantic Network Array Processor (SNAP) as a massively parallel platform for high performance and large-scale natural language processing systems. SNAP is an experimental massively parallel machine which is dedicated to, but not limited to, the natural language processing ushag semantic networks. In designing the SNAP, we have investigated various natu...
متن کاملViewpoint-Based Similarity Discernment on SNAP
This paper presents an algorithm for viewpointbased similarity discernment of linguistic concepts on Semantic Network Array Processor (SNAP). The viewpoint-based similarity discernment plays a key role in retrieving similar propositions. This is useful for advanced knowledge processing areas such as analogical reasoning and case-based reasoning. The algorithm assumes that a knowledge base is co...
متن کاملA Representation for Natural Category Systems
Most AI systems model and represent natural concepts and categories using uniform taxonomies, in which no level in the taxonomy is distinguished. We present a representation of natural taxonomies based on the theory that human category systems are non-uniform. There is a basic level which forms the core of a taxonomy; both higher and lower levels of abstraction are less important and less usefu...
متن کامل